Score tests for zero-inflation and overdispersion in two-level count data

نویسندگان

  • Hwa Kyung Lim
  • Juwon Song
  • Byoung Cheol Jung
چکیده

In a Poisson regression model, where observations are either clustered or represented by repeated measurements of counts, the number of observed zero counts is sometimes greater than the expected frequency by the Poisson distribution and the non-zero part of count data may be overdispersed. The zero-inflated negative binomial (ZINB) mixed regression model is suggested to analyze such data. Previous studies have proposed score statistics for testing zero-inflation and overdispersion separately in correlated count data. Here, we also deal with simultaneous score tests for zeroinflation and overdispersion in two-level count data by using the ZINB mixed regression model. Score tests are suggested for 1) zero-inflation in the presence of overdispersion, 2) overdispersion in the presence of zero-inflation, and 3) zero-inflation and overdispersion simultaneously. The level and power of score test statistics are evaluated by a simulation study. The simulation results indicate that score test statistics may occasionally underestimate or overestimate the nominal significance level due to variations in random effects. This study proposes a parametric bootstrap method to overcome this problem. The simulation results of the bootstrap test indicate that score tests hold the nominal level and provide good power. keyword : Zero-Inflation, Overdispersion, Generalized Linear Mixed Models, Zero-Inflated Negative Binomial, Score Test, Bootstrap

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Score Tests for Zero-inflation and Over-dispersion in Generalized Linear Models

Discrete data in the form of counts often exhibit extra variation that cannot be explained by a simple model, such as the binomial or the Poisson. Also, these data sometimes show more zero counts than what can be predicted by a simple model. Therefore, a discrete generalized linear model (Poisson or binomial) may fail to fit a set of discrete data either because of zero-inflation, because of ov...

متن کامل

Zero-inflated generalized Poisson models with regression effects on the mean, dispersion and zero-inflation level applied to patent outsourcing rates

This paper focuses on an extension of zero-inflated generalized Poisson (ZIGP) regression models for count data. We discuss generalized Poisson (GP) models where dispersion is modelled by an additional model parameter. Moreover, zero-inflated models in which overdispersion is assumed to be caused by an excessive number of zeros are discussed. In addition to ZIGP regression introduced by Famoye ...

متن کامل

Tests for zero-inflation and overdispersion: A new approach based on the stochastic convex order

A new methodology to detect zero-inflation and overdispersion is proposed, based on the comparison of the expected sample extremes among convexly ordered distributions. The method is very flexible and includes tests for the proportion of structural zeros in zero-inflated models, tests to distinguish between two ordered parametric families and a new general test to detect overdispersion. The per...

متن کامل

Using observation-level random effects to model overdispersion in count data in ecology and evolution

Overdispersion is common in models of count data in ecology and evolutionary biology, and can occur due to missing covariates, non-independent (aggregated) data, or an excess frequency of zeroes (zero-inflation). Accounting for overdispersion in such models is vital, as failing to do so can lead to biased parameter estimates, and false conclusions regarding hypotheses of interest. Observation-l...

متن کامل

Estimation of Count Data using Bivariate Negative Binomial Regression Models

Abstract Negative binomial regression model (NBR) is a popular approach for modeling overdispersed count data with covariates. Several parameterizations have been performed for NBR, and the two well-known models, negative binomial-1 regression model (NBR-1) and negative binomial-2 regression model (NBR-2), have been applied. Another parameterization of NBR is negative binomial-P regression mode...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Computational Statistics & Data Analysis

دوره 61  شماره 

صفحات  -

تاریخ انتشار 2013